Search CORE

21 research outputs found

Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion

Author: A Abad
A Cardenal-Lopez
A Cardenal-López
A Jansen
A Jansen
A Martin
A Moreno
A Moreno
A Moreno-Sandoval
A Stolcke
Alejandro Coucheiro-Limeres
AM Azmi
Antonio Cardenal
Antonio Miguel
B Logan
B Logan
B Ma
B Taras
B Zhang
C Ni
C Parada
Carmen Garcia-Mateo
CJ Chen
D Can
D Karakos
D Povey
D Vergyri
D Vergyri
Doroteo T. Toledano
F Metze
F Metze
GJF Jones
H Joho
H Joho
H Su
H-Y Lee
H-Y Lee
HVD Heuvel
I Szöke
I Szöke
I-F Chen
I-F Chen
J Chiu
J Chiu
J Chiu
J Garofolo
J Li
J Mamou
J Mamou
J Pinto
J Tejedor
J Tejedor
J Trmal
J van Hout
Javier Tejedor
JG Fiscus
Julia Olcoz
Julian David Echeverry-Correa
K Iwata
K Thambiratmann
KM Knill
KM Knill
L Docío-Fernández
L Mangu
Laura Docio-Fernandez
LJ Rodríguez-Fuentes
M Bisani
M Cai
M Ma
M Saraclar
M Wollmer
M Zelenák
MJF Gales
MS Seigel
N Rajput
NF Chen
NF Chen
P Yu
Paula Lopez-Otero
R Justo
S Nakagawa
SP Rath
T Ng
T Ohno
T Sakai
V Mitra
V-B Le
X Anguera
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

The electronic version of this article is the complete one and can be found online at: http://dx.doi.org/10.1186/s13636-015-0063-8Spoken term detection (STD) aims at retrieving data from a speech repository given a textual representation of the search term. Nowadays, it is receiving much interest due to the large volume of multimedia information. STD differs from automatic speech recognition (ASR) in that ASR is interested in all the terms/words that appear in the speech data, whereas STD focuses on a selected list of search terms that must be detected within the speech data. This paper presents the systems submitted to the STD ALBAYZIN 2014 evaluation, held as a part of the ALBAYZIN 2014 evaluation campaign within the context of the IberSPEECH 2014 conference. This is the first STD evaluation that deals with Spanish language. The evaluation consists of retrieving the speech files that contain the search terms, indicating their start and end times within the appropriate speech file, along with a score value that reflects the confidence given to the detection of the search term. The evaluation is conducted on a Spanish spontaneous speech database, which comprises a set of talks from workshops and amounts to about 7 h of speech. We present the database, the evaluation metrics, the systems submitted to the evaluation, the results, and a detailed discussion. Four different research groups took part in the evaluation. Evaluation results show reasonable performance for moderate out-of-vocabulary term rate. This paper compares the systems submitted to the evaluation and makes a deep analysis based on some search term properties (term length, in-vocabulary/out-of-vocabulary terms, single-word/multi-word terms, and in-language/foreign terms).This work has been partly supported by project CMC-V2 (TEC2012-37585-C02-01) from the Spanish Ministry of Economy and Competitiveness. This research was also funded by the European Regional Development Fund, the Galician Regional Government (GRC2014/024, “Consolidation of Research Units: AtlantTIC Project” CN2012/160)

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Springer - Publisher Connector

Repositorio Universidad de Zaragoza

Biblos-e Archivo

Topic Tracking Using Chronological Term Ranking

Author: F Can
F Can
J Allan
JG Fiscus
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Diarizace telefonních hovorů Jazykové poradny Ústavu pro jazyk český

Author: JG Fiscus
M Senoussaoui
P Campr
P Kenny
S Ioffe
SH Shum
Z Zajíc
Z Zajíc
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

V tomto článku popisujeme diarizaci archivu Jazykové poradny vznikajícím v rámci projektu "Zpřístupnění dotazů jazykové poradny v lingvisticky strukturované databázi". Jedna část tohoto archivu je nahraná pouze v mono kvalit, naším úkolem je proto rozdělit data pomocí diarizace. Náš přístup využívá informace o identitě jazykového poradce získané z přepisu jeho představení na začátku každého z hovorů. Protože naše data jsou jedinenčná, pro porovnání uvádíme také výsledky dostupného systému diarizace Kaldi.In this paper, we describe a diarization of the archive data from the project “Access to a Linguistically Structured Database of Enquiries from the Language Consulting Center”. This project is attempting to provide improved access to the large archives of the Czech language of mainly telephone conversations collected continuously by The Language Consulting Center. One part of this archives contains mono recordings, where the data of the client and the language counsellor are mixed in one channel. In our proposed approach to a diarization, we used the information about the identity of the language counsellor acquired from the text transcription on the beginning of the conversation. For the initial stage of the diarization, our system based on clustering the x-vectors was adopted. The resegmentation step is used for refining the boundaries of speaker changes by the pre-trained Gaussian mixture model of the counsellor. Because of the uniqueness of our data, we compared our results with the Kaldi diarization as the baseline system

Crossref

University of West Bohemia Digital Library

DSpace at University of West Bohemia

Diarizace založená na identifikaci pomocí x-vektorů

Author: JG Fiscus
M Senoussaoui
P Campr
P Kenny
SH Shum
Z Zajic
Z Zajíc
Z Zajíc
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

V tomto článku popisujeme diarizaci mono telefonních dat z Jazykové poradny Ústavu pro jazyk český. Náš navrhovaný přístup k diarizaci využívá informace o identitě jednoho z účastníků hovoru. V klasickém přístupu k diarizaci nahrazujeme shlukování x-vektorů identifikací řečníka.In this paper, we describe a diarization of mono channel telephone recordings from The Language Consulting Center providing the Czech language consultancy service. In our proposed approach to a diarization, we use information about the known identity of one speaker (the language counsellor) acquired from the text transcription at the beginning of the conversation. In the state-of-the-art diarization based on the x-vectors clustering, we replace the clustering step by the identification of each segment of the recording against the counsellor’s identity x-vector and the general x-vector model that represents the client. Our proposed diarization without resegmentation step can be used as an online approach. Because of the uniqueness of our data, we compare our results with the Kaldi diarization as the baseline system

Crossref

University of West Bohemia Digital Library

DSpace at University of West Bohemia

Enhancing Labeled Data Using Unlabeled Data for Topic Tracking

Author: DM Blei
J Allan
JG Fiscus
K Markert
K Nigam
LM Manevitz
M Belkin
RE Schapire
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Neural regulation of cyclic AMP, cyclic AMP-dependent protein kinase, and phosphorylase in bullfrog ventricular myocardium.

Author: Brown JH
Fiscus RR
Hardman JG
Hayes JS
Hayes JS
Hess ME
Keely
Lefkowitz RJ
Mayer SE
Pindok MT
Tsien RW
Wastila WB
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date
Field of study

Crossref

The public health approach to identify antiretroviral therapy failure: high-level nucleoside reverse transcriptase inhibitor resistance among Malawians failing first-line antiretroviral therapy

Author: Arribas
Bacheler Lee
Brenner
Coutsinos
Debbie Kamwendo
Ferradini
Gallant
Garcia-Lerma
Gulick
Harries
Huang
Isaakidis
Joep JG van Oosterhout
Johnson
Johnstone Kumwenda
Joseph J Eron
Julie AE Nelson
Kamya
Marcelin
Marconi
Margot
Mina C Hosseinipour
Neil Parkin
Parikh
Parikh
Parikh
Petropoulos
Phillips
Ralf Weigel
Sam Phiri
Sung
Sungkanuparph
Sungkanuparph
Susan A Fiscus
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date
Field of study

Crossref